OPTIMAL ^-ESTIMATION OF A SPHERICAL LOCATION 



Christophe Ley 1 , Yvik Swan 2 , Baba Thiam 3 and Thomas Verdebout 3 

1 Departement de Mathematique, E.C.A.R.E.S., Univer site Libre de Bruxelles, Belgique 
2 UR en Mathematique, Universite du Luxembourg, Luxembourg 
3 Laboratoire EQUIPPE, Universite Lille Nord de France, France 



Abstract: In this paper, we provide 7i-estimators of the location of a rotationally symmetric dis- 
tribution on the unit sphere of R fe . In order to do so we first prove the local asymptotic normality 
property of a sequence of rotationally symmetric models; this is a non standard result due to the 
curved nature of the unit sphere. We then construct our estimators by adapting the Le Cam 
one-step methodology to spherical statistics and ranks. We show that they are asymptotically 
normal under any rotationally symmetric distribution and achieve the efficiency bound under a 
specific density. Their small sample behavior is studied via a Monte Carlo simulation and our 
methodology is illustrated on geological data. 
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1 Introduction 

Spherical data arise naturally in a broad range of natural sciences such as geology and astro- 
physics (see, e.g., Watson (1983) or Mardia and Jupp (2000)), as well as in studies of animal 
behavior (see Fisher et al. (1987)) or even in neuroscience (see Leong and Carlile (1998)). It 
is common practice to view such data as realizations of random vectors X taking values on 
the surface of the unit sphere S k ~ l := {v G M fc | ||v|| = 1}, the distribution of X depending 
only on its distance - in a sense to be made precise - from a fixed point € S k ~ 1 . This 
parameter 6, which can be viewed as a "north pole" (or "mean direction") for the problem 
under study, is then a location parameter for the distribution. 

The first distribution tailored to the specificities of spherical data is the Fisher- Von Mises- 
Langevin (FVML) distribution introduced in Fisher (1953). To this date it remains the 
distribution which is the most widely used in practice; it plays, in the spherical context, the 
central role enjoyed by the Gaussian distribution for linear data. The FVML is, obviously, not 
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the only so-called "spherical distribution" , and there exists a wide variety of families of such 
distributions possessing different advantages and drawbacks (see Mardia and Jupp (2000) for 
an overview). 

In this work we will concentrate our attention on the family of rotationally symmetric 
distributions introduced by Saw (1978) (see Section 2 below for a definition). Aside from the 
fact that it encompasses many well-known spherical distributions (including the FVML), this 
family satisfies a natural requirement : it is invariant to the actual choice of "north pole". 
This entails that the family falls within the much more general class of statistical group models 
(see for instance Chang (2004)) and thus enjoys all the advantages of this class. Moreover, it 
satisfies the following fundamental lemma, due to Watson (1983). 

Lemma 1.1 (Watson (1983)). If the distribution of X is rotationally symmetric on S k ~ l 
and if the true location is 6, then (i) X'0 and 

Sfl(X) := (X — (X'#)#)/||X — (X'^)^H are stochastically independent and (ii) the multivariate 
sign vector S#(X) is uniformly distributed on S k ~ 1 (0~ L ) := {v € R fc | ||v|| = 1, v'0 = 0}. 

Estimation and testing procedures for the spherical location parameter 6 have been ex- 
tensively studied in the literature, with much of the focus in the past years being put on the 
class of M-estimators. An M-estimator associated with a given function po(x;6) is defined 
as the value of 6 which minimizes the objective function 

d^p(6) :=^ Po (Xi;0), 

i=i 

where Xi, . . . ,X n are spherical observations. These M-estimators are robust to outliers (see 
Ko and Chang (1993)) and enjoy nice asymptotic properties (see Chang and Rivest (2001), 
Chang and Tsai (2003) or Chang (2004)). In particular, the choice po(x;6) = arccos(x'0) 
yields the so-called spherical median introduced by Fisher (1985); taking po( x ;0) = || x — 0\\ 2 
yields 6 = X/||X||, the spherical mean. 

The first to have studied ranks and rank-based methods within the spherical frame- 
work are Neeman and Chang (2001), who construct rank score statistics of the form := 
Yli=l V 7 {Rf/( n + 1)) Sfl(Xj), where ip is a score generating function and where Rf (i = 
1, . . . , n) stands for the rank of ||Xj — (X-0)0|| among the scalars ||Xi — (X]0)0||, . . . , ||X n — 
(X£j0)0||. Some years later, Tsai and Sen (2007) developed rank tests (using a similar defini- 
tion for the ranks Rf) for the location problem T-Lq : = 6q. Assuming that the observations 
Xi, . . . ,X n are independent and identically distributed (i.i.d.) with a rotationally symmetric 
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distribution on the unit sphere, they consider statistics of the form (T^ o )T T T^ q , where 

(n) 

Tx is the asymptotic variance of T under the null. They obtain the asymptotic properties 
of their procedures via permutational central limit theorems. 

The purpose of the present work is to propose optimal rank-based estimators (/^-estimators) 
for the spherical location 6. The backbone of our approach is the so-called "Le Cam method- 
ology" (see Le Cam (1986)), which has, to the best of our knowledge, never been used in 
the framework of spherical statistics. This is perhaps explained by the curved nature of the 
parameter space: the unit sphere S k ~ l being a non-linear manifold, it typically generates non- 
traditional Gaussian shift experiments and, as a consequence, the usual arguments behind Le 
Cam's theory break down in this context. 

The first step consists in establishing the uniform local asymptotic normality (ULAN) of 
a sequence of rotationally symmetric models. This we achieve by rewriting 6 G S k ~ 1 in terms 
of the usual spherical ^-coordinates and showing that ULAN holds for this more common re- 
parameterization. Although the latter parameterization is not valid uniformly on the whole 
sphere (its Jacobian is not full-rank everywhere), we then use a recent result of Hallin et al. 
(2010) to show how the ULAN result in the ^-parameterization carries through to the original 
^-parameterization. 

The second step consists in adapting to the spherical context the so-called method of 
one-step R-estimation "a la Le Cam", introduced in Hallin et al. (2006) in the context of 
the estimation of a shape matrix of a multivariate elliptical distribution. The ULAN prop- 
erty mentioned above guarantees that the resulting rank-based estimators enjoy the desired 
optimality properties. 

Our estimators constitute attractive alternatives to the traditional Hodges-Lehmann rank- 
based estimators which have, in general, two major drawbacks. First, they are defined through 
minimization of a rank-based function which is therefore non-continuous; this complicates 
greatly the study of their asymptotic properties. Secondly, as shown via a Monte Carlo study 
in Hallin et al. (2011) in a regression context, the performances of Hodges-Lehmann- type 
estimators tend to deteriorate as (the dimension) k increases. As will be shown in this paper, 
one-step /^-estimators suffer from neither of these flaws and are even good competitors against 
the M-estimators studied in the literature. 

The outline of the paper is as follows. In Section 2 we recall the definition of the family 
of rotationally symmetric distributions and prove uniform local asymptotic normality of this 
family. We devote Section 3 to a description of our rank-based estimators and to the study of 
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their asymptotic properties. In Section 4 we compare our /^-estimators with the M-estimators 
from the literature in terms of Asymptotic Relative Efficiency as well as by means of a Monte 
Carlo study. In Section 5 we apply our ^-estimators to geological data. Finally an Appendix 
collects the technical proofs. 



2 Spherical model and ULAN 

Throughout, the data points Xi, . . . , X n are assumed to belong to the unit sphere iS^" 1 of M. k 
and to satisfy the following assumption. 

Assumption A. Xi, . . . , X n are i.i.d. with common distribution characterized by a 
density function (with respect to the usual surface area measure on spheres) 

x^f g (x) = c kJl f 1 (x'0), xeS w , (2.1) 

where 6 G S k ~ 1 is a location parameter and f\ : [—1, 1] — > M.q is absolutely continuous and 
(strictly) monotone increasing. 

A function fx satisfying Assumption A will be called an angular function] we throughout 
denote by J- the set of angular functions. This choice of terminology reflects the fact that, 
under Assumption A, the distribution of X depends only on the angle between it and some 
location G S k ~ 1 . If Xi, . . . , X n are i.i.d. with density (2.1), then X^, . . . ,X'„0 are i.i.d. 
with density 

1 1 ^ f\(t) := "\^' h „ /i(t)(l - t 2 )^' 2 , -1< t < 1, 

where uj^ = 2ir k / 2 /T(k/2) is the surface area of S k ~ 1 and B(-,-) is the beta function. The 
corresponding cdf will be denoted by F\{€). 

A special instance of (2.1) is the FVML distribution, obtained by taking angular functions 
of the form fi(t) = exp(ref) =: /i-exp(^) for some concentration parameter k > 0. Aside from 
the FVML distribution we will also consider spherical distributions with angular functions 

/i;Lin(a)(*) ■= t + a, /i ; Lo g (a) (*) : = log(t + a) and (2.2) 

_ oexp(-barccos(t)) 
/i;Logta(o,6) ' (1 + a exp(-6 arccos(t))) 2 ' 1 j 

we refer to these as the linear, logarithmic and logistic spherical distributions, respectively (a 

and b are constants chosen so that all the above angular functions belong to J-). 
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The rest of this section is devoted to the establishment of the ULAN property of the 



comment just below Proposition 2.1 for a definition of ULAN. As mentioned in the Introduc- 
tion, in order to obtain this ULAN property we first need to circumvent a difficulty inherited 
from the curved nature of the experiment we are considering. 

Among the first to have considered such "curved experiments", Chang and Rivest (2001) 
and Chang (2004) suggest to bypass the problem by reformulating the notion of Fisher in- 
formation in terms of inner products on the tangent space to In this paper, we rather 
adopt an approach based on recent results from Hallin et al. (2010) and, in particular, on the 
following lemma. 

Lemma 2.1 (Hallin et al. (2010)). Consider a family of probability distributions = 
{p£ n) | u g n} with n an open subset of ~R kl (k\ G No,). Suppose that the parameterization 
u !-»• PL n) is ULAN for V^ n) at some point wo 6 with central sequence A^d and Fisher 
information matrix T Wo . Let a* : u> t— > i9 := c?(w) be a continuously differentiable mapping 
from M. kl to M fc2 (k% < &2 G No,) with full column rank Jacobian matrix Dol(u)) at every 
U) in some neighborhood of Uq. Write © := ct{Q,), and assume that t? i— > P^' d , •& G G, 
provides another parameterization of "p( n ) . Then i? i— > p^'^ is also ULAN for "p( n ) at i?o = 
d(u>o), with central sequence A^'^ = (D~(l(u}o))'A^ and Fisher information matrixT^ Q = 
(D-n(u ))T Uo D-n(u Q ), where £> _ 3(w ) := ((/^(wo))'^^))" 1 ^^))' is the Moore- 
Penrose inverse of Dcl(u)o). 

We start by establishing ULAN for a re-parameterization of the problem in terms of 
spherical coordinates. Any vector 6 on the unit sphere of M. k can be represented via the chart 



family {P ( 



,(n) 
OJi 



G S k 1 }, where Pg™^ stands for the joint distribution of Xi, . . . , X n ; see the 



h:rj:= (t/i 



. . . ,rjk-i)' G R fc i y h(rf) = 6 = (cos rji , sin rji cos 772 



(2.4) 



sin rji ■ ■ • sin r/ fc _ 2 cos r/ fc _i, sin 771 • • • sinr/ fc _ 2 sin 77^-1)', 



whose Jacobian matrix Dh{rj) is given by 
^ — sin 771 











\ 



cos 771 cos 772 



— sin 771 sin 772 
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cos 771 n 
cos 771 n 




-llj=i smT/j 
sin^cos 77 fc _! J 
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The corresponding ^-parameterization {Pi?£ I V £ is simple to construct and enjoys 

the advantage of a classical parameter space, namely This is very convenient for 

asymptotic calculations; in particular, it is helpful when quadratic mean differentiability and 
consequently ULAN have to be proved. 

Before proceeding to the statement of our results we need the following (essentially tech- 
nical) assumption. 

Assumption B. Letting ipf x := f\j f\ (f± is the a. e. -derivative of /i), the quantity 
Jk(fi) ■= I\ V% - t 2 )h (t)dt < +00. 

Assumption B entails that the Fisher information matrix for spherical location is finite (in both 
the rj- and the original ^-parameterization) . More precisely, we will show that the information 
matrix for the 77-parameterization with chart K is of the form 

^A D h{ri)'{l k - h{r,)h{ri)')Dh{n) 

with 1^ standing for the k x k identity matrix. This matrix can be rewritten as — „, 
where 

Qfc jJ? := diag(l,sin 2 ?/i,sin 2 r?isin 2 r/2,...,sin 2 7/i---sin 2 r/ fc _ 2 ) 
= Dh(rj)'DH(ri) 
= Dh{rj)'{l k - %)%)')£>%), 

with diag(ai, . . . , a;) denoting a diagonal matrix with diagonal elements ai, . . . , a\. Hence, if 
DH(rj) is full column rank, then the information matrix fijfc )}J is also full-rank. 

With this in hand we are able to establish the ULAN property of the family {P^ff \ f) £ 
R fc_1 } at any point r]Q £ M. k ~ l (note that, clearly, at some points the information matrix 
will be singular, due to the rank deficiency of the Jacobian matrix). In order to avoid heavy 
notations, we drop the index in what follows and write out the LAN property at 7/ £ R^ 1 
in the following proposition (see the Appendix for a proof). 

Proposition 2.1. Let Assumptions A and B hold. Then the family of probability distributions 
\^rr^h I ^ ^ K^'" 1 j is LAN with central sequence 

n 

A Sf := n- 1/2 ^^/ 1 (X^(r?))(l " Wh( V )) 2 )V 2 Dh(r,yS m (X t ) (2.5) 
i=i 
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(2.6) 




-(eW)1* A eW+op(l) 



(2.7) 




oo. 



In the sequel, we will make use of a slightly reinforced version of Proposition 2.1, namely 
the ULAN property of our sequence of models. A sequence of rotationally symmetric models 



r). In the present setup, it can easily be verified that ULAN holds by using the same arguments 
as in the proof of Proposition 2.1. 

Note that the diagonality of the Fisher information (2.6) is convenient when working in 
the ^-parameterization. Indeed it is the structural reason why, when performing asymptotic 
inference on r/i for example, the non-specification of the parameters rjz,... ,r}k-i will not be 
responsible for any loss of efficiency. Our focus being on the ^-parameterization, we will not 
investigate this issue any further. 

The ^-parameterization obtained via the chart h suffers from the drawback that the 
information matrix is singular at several points on the sphere. More precisely, if sin rji = 
for some rji £ {rji, . . . ,77^-2}, then Oj^ _ is singular. For example, in the 3-dimensional case, 
the parameter values 6 = (1,0')' or 6 = (—1,0')' are very particular; for those values, fi§„ = 
diag(l, 0). This phenomenon is due to an identification problem: in the general fc-dimensional 
case, when rji = for example, then Pjj ljtK) ... = P??i,^2,...,%_i;/i f° r an Y — 2)-uples 
(772, • • • , Vk-l) (fhi • • • 1 Vk-i)- in the same way, whenever rjj = for some j G {1, . . . , k — 2}, 
the Jacobian matrix Dh(rj) is not full-rank and as a consequence 1^.^ is singular. This 



is called ULAN if, for any r]^ n > such that = 0(n l l 2 ) and any bounded sequence 
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singularity - and the identification problem from which it results - is however not structural 
and only originates in the choice of the chart. 

Since every point of a m-dimensional manifold has a neighborhood homeomorphic to an 
open subset of the m-dimensional space W 71 , we see that, for all 9 G <S one can find a 
chart T : R fc_1 i— > S k " 1 : f] i— > = t(rfj with a full column rank Jacobian matrix Dt(rj) in the 
vicinity of rf. Making use of Proposition 2.1, whose proof does not involve the particular form 
of h, we obtain that the family {Pi?*^ | V £ IK^'" 1 } is ULAN with central sequence 

n 

Ag; r := n- 1 /2^ ¥ , /i(X ^(r ? ))(l - (X^%)) 2 ) 1 /2 jD r( 97 )'S^ ) (X i ) 

i=l 

and full-rank Fisher information matrix 

This shows that, for any 8 G it is possible to find a chart t such that, in a neighborhood 

of r) = t~ 1 (9), the Jacobian matrix is non-singular and the ULAN property for the related 
family jpj^^l V e j holds. This observation, combined with Lemma 2.1, finally yields 

the desired ULAN property for the family jpj^ | G S^ 1 } (see the Appendix for a proof). 

Let 0^ G S k_1 be such that 0^ — = 0(n~ 1 / 2 ) and consider local alternatives on the 
sphere of the form 0^ + ?i -1 / 2 t( n ). For 0^ + n _1/,2 t( n ) to remain in S k ~ 1 , it is necessary 
that the sequence t( n ) satisfies 

= («W + n-Wj'^W + n-VHW) - 1 

= 2n- 1 / 2 (0 {n) )'t ( -^ +n-\t^yt( n \ (2.8) 

Consequently, t( n ) must be such that 2n~ 1 / 2 (0 {n) )'t^ + n~ l (t^)' = or equivalently 
such that 2n- 1 / 2 (0 (n) )'t (n) + o(n~ 1 / 2 ) = 0. Therefore, for (n) + n' 1 / 2 ^ to remain in S k ~\ 
t( n ) must belong, up to a o(n -1 / 2 ) quantity, to the tangent space to tS fe_1 at 8^ n \ 

Now, Lemma 2.1 provides a link between the ULAN properties of two different param- 
eterizations of the same model. This link is directly related to the link between the lo- 
cal alternatives + ra -1//2 e( n ) in the full-rank parameterization and the local alternatives 
8^ + ra^ 1 / 2 ^™) described above. As shown in (2.8), in order for 0^ + n~ l / 2 t^ to belong 
to S k ~ 1 , it is necessary that t( n ) be of the form t* + o(l), with t* in the tangent space to 
tS^™ 1 at 0^ n \ hence of the form t#* + o(l) with t**^ in the tangent space to S k ~ l at 8 that 
is. = Dt(fj)eW for some bounded sequence e( n ) G M k 1 . It follows from differentiability 
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that, letting rj^ = r 1 ^), 

0(n) +n -l/2 t (n) = 0(n) +n -l/2 Dt{r))e (n) +o{n ^l/2 ) 

= + n- 1 / 2 -D%)eW + o(n" 1 / 2 ) 

= +n- 1 / 2 e( n ) + o(n" 1/2 )), (2.9) 

hence there is a clear correspondence between linear perturbations in the ^-parameterization 
and perturbations on the sphere in the ^-parameterization (through the chart T). As a di- 
rect consequence, the local alternatives 6^ + ra _1 / 2 t( n ) are equivalent to those considered 
in Tsai (2009). Now, turning to local log-likelihood ratios, in view of ULAN for the rj- 
parameterization, 

loe(dP (n) ldP {n) \ ~ loefdP (n) /dP {n) 

= eW'AW^-ie(™)T5 ;/l eW+op(l) 

(n) (n) 

under P^ ( „).^ = P^(»j ^ -probability, as n — > oo. Summing up, we have the following result. 

Proposition 2.2. Let Assumptions A and B hold. Then the family of probability distributions 
{p^ | 6 is ULAN with central sequence 

n 
i=l 

and Fisher information matrix 

More precisely, for any 6^ 6 suc/i i/ioi 0^ — 6 = 0{n~ 1 / 2 ) and any bounded sequence 

t( n ) as in (2.8), we have 

'dP (n) \ 

log | = (tW/Ag, - i(tW)V (B) +op(i) 



and ^l). fl ~^ M=-i(O,r 0;/l ), &oi/i under P^.^j as n ~> 00 • 



See the proof in the appendix in which the expressions of A^.^ and Tg.j 1} are obtained. 



9 



3 Rank-based estimation: optimal ^-estimators 

In this section, we make use of the ULAN property of Proposition 2.2 to construct semi- 
parametrically efficient /^-estimators of 0. To this end, we adapt the Le Cam technique of 
one-step iZ-estimation, introduced in Hallin et al. (2006), to the present context. 

Le Cam's one-step /^-estimation method assumes the existence of a preliminary estimator 
of satisfying some conditions, which are summarized in the following assumption. 

Assumption C. The preliminary estimator G S k ~ l is such that 

(i) 6-0 = O v {n-V*) under U^Pgf 

(ii) 6 is locally and asymptotically discrete; that is, it only takes a bounded number of distinct 
values in ^-centered balls with 0{n~ 1 / 2 ) radius. 



Assumption C(i) requires that the preliminary estimator is root-n consistent under the 
whole set T of angular functions. This condition is fulfilled by, e.g., the spherical mean or 
the spherical median. This uniformity in g\ £ T plays an important role for our i?-estimation 
procedures, as we shall see in the sequel. Regarding Assumption C(ii), it should be noted that 
this discretization condition is a purely technical requirement (see, e.g., Hallin et al. (2011)), 
with little practical implications (in fixed-n practice, such discretizations are irrelevant as the 
discretization radius can be taken arbitrarily large). Therefore, for the sake of simplicity, we 
tacitly assume in the sequel that satisfies Assumption C(ii). 

3.1 Rank-based central sequence and its asymptotic properties 

The main idea behind one-step /^-estimation consists in adding to the preliminary estimator 

a rank-based quantity which provides optimality under a fixed density. This rank-based 

quantity will appear through a rank-based version A^. (rigorously defined in (3.2) below) 
of the parametric central sequence obtained in Proposition 2.2. A natural requirement for 

our rank-based central sequence A^- is its distribution- freeness under the broadest possible 

family of distributions, say Pl"oad" ^ c l ass i ca l wa Y to achieve this goal consists in having 

(n) 

recourse to the so-called invariance principle which, when P^-oad * s i n va riant under a group 
of transformations, recommends expressing Al n L in terms of the corresponding maximal 
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invariant. This entails that if, furthermore, this group of transformations is a generating 

group for P^oad' then is distribution-free under P^oad 1 

Clearly, in this rotationally symmetric context, invariance with respect to rotations is 

crucial, and therefore it seems at first sight natural to study P^ oad = Uses*- 1 f° r fi xe d 
g\ € F under the effect of the group of rotations. This group is generating for {J eeS k-i Pg^j 
and the ranks Rf defined in the Introduction are rotationally invariant because the scalar 
products X-0 are invariant with respect to rotations. This is not the case of the multivariate 
signs Se(Xj), which are only rotationally equivariant in the sense that Sg(OXj) = OSg(Xj) 
for any O G 50^, the class of all k x k orthogonal matrices. However, since our aim con- 
sists in estimating 0, we rather work under fixed-0 assumptions, hence the above-mentioned 
family P[,™o ad makes little sense in the present context. Furthermore, when inference on is 
considered, the angular function g\ remains an infinite-dimensional nuisance, which indicates 
that statistics which are invariant and therefore distribution- free under U^g^Pe^ fixed) 
should in fact be taken into account, if possible without losing invariance properties with 
respect to the group of rotations. 

Fix e S^ 1 and consider P^ ad = U gi6 .F P 2r We obviously have that X; = (X^0)0 + 
yT — (X^) 2 Sg(Xj) by definition of the multivariate signs. Now, let be the group of 
transformations of the form g^' : (Xi, . . . , X„) h-» (^(Xi), . . . , <7/ l (X n )) with 



g h (Xi) :=MX^)0 + ^l-fc(Xp) 2 S,(Xi), i = l,...,n, (3.1) 

where h : [—1, 1] —> [—1, 1] is a monotone continuous nondecreasing function such that h(l) = 
1 and h(—l) = —1. For any g^ £ 5i , it is easy to verify that ||<7j-"^(Xj)|| = 1; this 
means that g^ G 0^ is a monotone transformation from (5 fc_1 ) n to (5 fc_1 )™. It is quite 

(n) 

straightforward to see that the group Q h is a generating group of the family of distributions 
I I P (n) 

These considerations naturally raise the question of finding the maximal invariant asso- 
ciated with . Note that the definition of the mapping in (3.1), together with the fact 
that 0'Sfl(Xj) = 0, entails that 



g h (Xi) - {g h {yLi)'0)0 = Jl - h(X^) 2 S fl (Xi). 
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Then, we readily obtain that, for i = 1 



n 




g fe (X t ) - {g h {Xi) f e)e 
9h (X 4 ) - G7 h (XO'0)0 



This shows that the vector of signs S#(Xi), . . . , S#(X n ) is invariant under the action of Gf, ■ 
However, it is not a maximal invariant; indeed, the latter is in most semiparametric setups 
composed of signs and ranks (see for instance Hallin and Paindaveine (2006)). Now, the ranks 
Rf are not invariant under the action of : one can easily find a monotone transformation 
g^ as in (3.1) such that the vector of ranks Rf computed from Xi, . . . , X n differs from the 
vector of ranks Rf computed from ^(Xi), . . . , <^(X n ). Thus, we need a different concept of 
ranks in order to build a rank-based version of our central sequence. For this purpose, define, 
for all i = 1, . . . , n, Ri as the rank of X^0 among X^0, . . . , X^0. Since in general, the ranks are 
invariant with respect to monotone transformations, noting that gh(X-i)'0 = /i(X-0) directly 

(ri) 

entails the invariance of these new ranks under the action of the group Q h . The maximal 
invariant associated with is, therefore, the vector of signs S#(Xi), . . . , S#(X n ) and ranks 
Ri, . . . , R n . 

While it is preferable to use the ranks Ri when invariance with respect to is required, 
there exist situations in which the ranks Rf are appealing, e.g. when the angular functions are 
of the form f\(t) = exp(g(t)) with g(—t) = —g(t) (since then the score function ^^(t) = g(t) 
in (2.5) is symmetric), see Tsai and Sen (2007). 

As explained above, it follows that any statistic measurable with respect to the signs 
Sfl(Xj) and ranks Ri is distribution- free under UgieJ"^^- ^ n accordance with these findings, 
we choose to base our inference procedures on the following sign- and rank-based version of 
the parametric central sequence obtained in Proposition 2.2: 



where K is a score function satisfying 

Assumption D. The score function K is a continuous function from [0, 1] to R. 
Note that all the score functions associated with the densities in (2.3) satisfy this assumption. 




(3.2) 
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In the following result (the proof is given in the Appendix), we derive the asymptotic 

properties of under and under contiguous alternatives Pj^_ n -i/2 t (n). (where t( n ) 

is a bounded sequence as described in (2.8)) for some angular function g\ G T . 

Proposition 3.1. Let Assumptions A, B and D hold. Then the rank-based central sequence 



(") 

( n ) aW m „„^dW 



(i) is such that A^.^- — ^- (/[ = op(l) under P#. 9l as n ^ oo, where (G\ 
stands for the common cdf of the 's under 'Pg!'g 1 ) 

n 



i=l 



tfence, /or K(tt) = ^(u) = ^ (Ff 1 (u))(l - (Ff 1 ^)) 2 ) 1 / 2 , Ag. is asymptotically 
equivalent to the efficient central sequence A^"^ for spherical location under Pg?^ • 



(ii) is asymptotically normal under with mean zero and covariance matrix Tq 



Jk(K) 



.„ a ... r a . . gi 

P(I fc - 66') , where J k (K) := K 2 (u)du. 



K 



(Hi) is asymptotically normal under ^Pg^_ n -i/2 t (n). g with meanTg ; x gi t (where t := linin^oo t( n ) ) 
and covariance matrix Tg-x, where, for J^[K,g\) := J K(u)K gi (u)du, 

J k {K, gi) , 



(iv) satisfies, under Pg\ gi as n oo, £/ie asymptotic linearity property 
/or any bounded sequence as described in (2.8). 

The main point of this proposition is part (iv); the preceding three results are necessary 

for proving that last part. However, they have an interest per se as they give the asymptotic 

properties of the rank-based central sequence A^-. Moreover, these results happen to be 
important when the focus lies on testing procedures for the location parameter 6. This is 

part of ongoing research. For the present paper, we are mainly interested in the asymptotic 

linearity property of Al n ;L more precisely on the asymptotic linearity property of 
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where Ri, i = 1, . . . , n, stands for the rank of among X^0, . . . , X^0. It is precisely here 
that Assumption C comes into play: it ensures that the asymptotic linearity property of 
Proposition 3.1(iv) holds after replacement of t^ n ) by the random quantity n 1//2 (0 — 6) (this 
can be seen via Lemma 4.4 of Kreiss (1987)), which eventually entails that 

Ag. - Ag» = -T e .^ ai n 1 /\e-e)+o P (l) (3.3) 

as n — > oo under Pg^ for g\ € T . We draw the reader's attention to the fact that (2.8) is 
satisfied when is replaced by n l l 2 {6 — 6). 

3.2 Properties of our i?-estimators 

Keeping the notation A~ for the Moore-Penrose inverse of some matrix A, let 



= « + n -'^i^iL(I t -W')A<">. 

Jk{K,gi) ~ e ' K 

= g + n-V 2 J*" 1 ) AW, 

MK, 9l ) ~0;K' 

where the last inequality holds since 6 Sx(Xj) = for all i = l,...,n. The expression 
of K\j k {K,gx) i s quite traditional when one-step estimation is considered. However, using 
K-,j k (K,g{) itself to estimate 9 is clearly unnatural since 0K;j k (K,gi) does not belong to S k ~ 1 
in general. This is why we propose the one-step i?-estimator 

0K;J k (K, 91 ) ■= 6K;J k (K, gi )/pK;J k (K, gi )\\ G S k ^ 1 

which is a normalized version of &K;j k (K,gi)- As it is shown in the sequel, the normalization 
has no asymptotic cost on the efficiency of the one-step method. 

Nevertheless, 0K;j k (K. gi ) is n °t a genuine estimator because it is still a function of the 
unknown scalar Jk(K,g\). This cross- information quantity requires to be consistently esti- 
mated in order to ensure asymptotic normality of our i?-estimators. To tackle this problem, 
we adopt here the idea developed in Hallin et al. (2006) still based on the ULAN property 
of the model. Define 6(8) := 6 + n~ 1 / 2 B(k - 1) A^l, 6(B) := 6(8)/\\6(8)\\ and consider the 
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quadratic form 



p^h^(B) ■= (A (n) )Tr A ( ™\ 



= ^_i)(A (n) yA( n) . (3.4) 
J k (K)~6;K J ~0<j));K y ' 

Then, part (iv) of Proposition 3.1 and the root-n consistency of 6{(3) (which follows from the 
root-re consistency of 6{f3) and the Delta method applied to the mapping x i— > x/||x||) imply 
that, after some direct computations involving (3.3), 

4^-4S = -r»* sl n" 2 (W)-«) + op(i) 

as n — y oo under P^. Moreover, it is clear that T . Ksi n 1/2 {6{(3) -6) = Tg. K n 1/2 {6{P) - 
0) + °p(1) as re — )• oo under P^"^- These facts combined with the definition of 0((3) entail 
that, under Pj, and for n — > oo, 

hi " ,{0) - Im ( -& ) '(-&- i '^" 1/2( * ot "* ) ) + ° p(1) 

- Iw ( -&'' " + op(1) 

where the passage from the first to the second line requires some computations involving 
again the Delta method, but for the sake of readability we dispense the reader from these 
calculatory details here; they follow along the same lines as those achieved below. In view of 
(3.4), h^ n \l3) can be rewritten as 

hWtf) = (1 - J k (K, 9l )P)hW(Q) + op(l) 

under Pg^ as n — > oo. Since h^ n \0) > 0, one obtains (the proof is along the same lines 
as in Hallin et al. (2006)) a consistent estimator of {jJ k {K, gi))" 1 given by (3 := inf{/3 > 
: /i( n )(/3) < 0}. Therefore, Jk(K,gi) := provides a consistent estimator of the cross- 
information quantity, and a genuine estimator of is provided by 6 ' K .j k ( K gi y Now, the Delta 
method and some easy computations show that under Pg!^ and for n — > oo 

^(i k -ee')(~e K;MKjgi) -e)+oAi), (3.5) 



n 
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where I& — 66' is the Jacobian matrix of the mapping x i— > x/||x|| (as already mentioned, 
the above-used Delta method works similarly) evaluated at 6. Then, (3.5), the definition of 
6 K .j k ( K g \ , part (iv) of Proposition 3.1, the consistency of Jk{K, g\) together with Assumption 
C entail that, under Pi n ' and as n — > oo, 

nl/2 VK;j ki K, 9l )-0) = ^k-e6')(6 K . MKm) -6]+o^{l) 

n 



ni k -66')(6 + n-^ Kg A^-6) + Ml) 
(I k - 66') (nV\6 - J)+r^AW - n^(6 - 6)) + o P (l) 

J^1LaW+pp(1), (3.6) 



Jk(K, 9l ] 

where the passage from the second to the third line draws upon (3.3). Wrapping up, we obtain 
the following result which summarizes the asymptotic properties of ® K-j k {K g x y 

Proposition 3.2. Let Assumptions A, B, C and D hold. Then, 

(i) under Pg!^, n l / 2 {6 K .j^ K s —6) is asymptotically normal with mean and covariance 
matrix (k - l)^gy (I* - 66') ; 

(ii) for K{u) = Kf t (u) = Lpf 1 (F 1 ~ 1 (u))(l — (F^ l {u)) 2 ) 1 / 2 , the estimator 6 K .j k i K ^\ is semi- 
parametrically efficient under Pg!^ • 

While part (i) of Proposition 3.2 is a direct consequence of (3.6), part (ii) requires more 
explanations. From (3.6), we obviously have that under Pg?^ 

as n — > oo. Now, using the identities Jk{Kf i: f\) = Jk{fi) and £^ Kf /i = ^9™/i> we obtain 
via part (i) of Proposition 3.1 that 

(fc ~ 1) A (n) (fe ~ 1) a (n) , nm /oon 
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under P^ as n — > oo. Combining (3.7) and (3.8) (and using again the fact that 0'Sg(X) = 0), 
it follows that 

still under P^?^ as n — > oo. This entails that K .j k (x is asymptotically efficient under 

In) 1 1 

Pg.ij which then finally yields the desired optimality properties. 

Finally note that if the preliminary estimator is rotation-equivariant, meaning that 
0(OX*) = O0(X;) for any matrix O G <S0 fc , then ^.^^(OX;) = O0 K . Jk{Kgi) (Xi). 
This implies that rotation-equivariance of our .R-estimators is inherited from the preliminary 
estimator. 



4 Asymptotic relative efficiencies and simulation results 

In this section, we compare the asymptotic and finite-sample performances of the proposed 
/^-estimators with those of the spherical mean and spherical median. The asymptotic perfor- 
mances are analyzed in Section 4.1 on basis of asymptotic relative efficiencies (ARE), while 
the finite-sample behavior is investigated in Section 4.2 by means of a Monte Carlo study. 

4.1 Asymptotic relative efficiencies 

As already mentioned in the Introduction, the traditional estimators of 6 in the literature are 
M-estimators. Such an M -estimator is defined, for a given function po(x;0), as the value of 
which minimizes the objective function 

n 

0^p{0) :=^ Po (Xi;0), 

i=i 

where Xj, . . . ,X n are spherical observations. Letting po(x;0) =: p(x'0), special instances of 
the above are the spherical mean #Mcan (maximum likelihood estimator under FVML distribu- 
tions, obtained by taking ip(t) := — p(t) = 2) and the spherical median ^Median (Fisher (1985), 
obtained by taking ip{t) = (1 — t 2 ) -1 ' 2 ). By Theorem 3.2 in Chang (2004), an M-estimator 
0M associated with the objective function p{x'0) is such that n 1 / 2 (^/ — 0) is asymptotically 
normal with mean zero and covariance matrix 

E[y, 2 (x'fl)(i-(x'0) 2 )] 

[ " ij E 2 [^X'0)MX'0)(l-(X'0) 2 )] [ ' 



17 



under P^ (expectations are taken under P^)- Now, let d\ and 62 be two estimators of 
the spherical location such that n}l 2 (6\ — 6) and n 1 / 2 ^ — 6) are asymptotically normal with 
mean zero and covariance matrices pi(I — 6d') and p2(I — 66 f ), respectively, under P^- Then, 



a natural way to compare the asymptotic efficiencies of 0\ and 62 (still under Pg^) is through 
the ratio AKEg. gi (61/62) = P1/P2 which we refer to as an ARE. The following result provides 
a general formula for ARE between an /^-estimator and an M-estimator. 

Proposition 4.1. Let 6 K .j k r K gi \ be the R-estimator associated with the score function K 
and let 6m be the M-estimator associated with the objective function p(x'0). Then 

ARF ra lh \ ni?(x , m-(m 2 )]J2(K,gi) 

K^8, gi [p K . MKgi) IV M ) - E 2ty(X'6)<p gi (X>0)(l - (X'0) 2 )]Jfc(#)' 

where AREg ;9l denotes the asymptotic relative efficiency under Pg!^ ■ 

In Tables 1 and 2, we collect numerical values of AREg ;gl for k = 3 under various un- 
derlying rotationally symmetric densities (namely those described right after Assumption A 
in Section 2). Several one-step i?-estimators are compared to both the spherical mean and 
the spherical median: $fvml(2) an d ^fvml(6)> based on FVML scores (with k = 2 and n = 6, 
respectively), #Lin(2) an d $Lin(4) based on linear scores (associated with linear angular densities 
with a = 2 and a = 4, respectively, see (2.2)), #Log(2.5) based on logarithmic scores (associ- 
ated with a logarithmic angular density with a = 2.5, see (2.2)) and finally #Logis(i,i) an d 
$Logis(2,i) based on logistic scores (associated with logistic angular densities with respectively 
a = 1, 6=1 and a = 2, 6 = 1, see (2.3)). 

Inspection of Tables 1 and 2 confirms the theoretical results obtained previously. When 
based on the score function associated with the underlying density, i2-estimators are optimal. 
For example, ^Lin(2) is the most precise estimator under P^ v ■ As expected the spherical 
mean dominates the i?-estimators under FVML densities since it is the maximum likelihood 
estimator in this case. For example, #fvml(2) is only just less or equally (under the FVML(2) 
density) efficient than the spherical mean under FVML densities but performs nicely under 
other densities. In general, the proposed i?-estimators outperform the spherical median. 

4.2 Monte Carlo study 

We now discuss the finite-sample behavior of different ii-estimators of the spherical location. 
For this purpose, we have generated M = 1000 samples from various 3-dimensional (k = 3) 
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AREs 


with respect to the spherical mean 


I AREffl it 


/ v Mean / / 




Underlying density 


^FVML(2) 


^FVML(E) 


*Lin(2) 


#Lin(4) 


8 Log(2.5) 


e Logia(l,l) 


e Logis(2,l) 


^Sqtl.l) 


FVML(l) 


0.9744 


0.8787 


0.9813 


0.9979 


0.9027 


0.9321 


0.7364 


0.7804 


FVML(2) 


1 


0.9556 


0.9978 


0.9586 


0.9749 


0.9823 


0.8480 


0.8932 


FVML(6) 


0.9555 


1 


0.9381 


0.8517 


0.9768 


0.9911 


0.9280 


0.9771 


Lin(2) 


1.0539 


0.9909 


1.0562 


1.0215 


1.0212 


1.0247 


0.8796 


0.9174 


Lin(4) 


0.9709 


0.8627 


0.9795 


1.0128 


0.8856 


0.9231 


0.7097 


0.7083 


Log(2.5) 


1.1610 


1.1633 


1.1514 


1.0413 


1.1908 


1.1625 


1.0951 


1.1376 


Log(4) 


1.0182 


0.9216 


1.0261 


1.0347 


0.9503 


0.9741 


0.7851 


0.8226 


Logis(l,l) 


1.0768 


1.0865 


1.0635 


0.9991 


1.0701 


1.0962 


0.9778 


1.0349 


Logis(2,l) 


1.3182 


1.4426 


1.2946 


1.0893 


1.4294 


1.3865 


1.5544 


1.4680 


Sq(l.l) 


1.2303 


1.3460 


1.1964 


1.0264 


1.3158 


1.3004 


1.3009 


1.3774 



Table 1: Asymptotic relative efficiencies of ii-estimators with respect to the spherical mean under various 
3-dimensional rotationally symmetric densities. 







AREs w 


th respect to the spherical median (AT{E(Bk /@ 


Median)) 




Underlying density 


^FVML(2) 


^FVML(S) 


L in(2) 


0Lin(4) 


8 Log(2.5) 


e Logis(l,l) 


e Logis(2,l) 


*Sq(l.l) 


FVML(l) 


1.0691 


0.9641 


1.0766 


1.0949 


0.9904 


1.0226 


0.8079 


0.8562 


FVML(2) 


1.1321 


1.0819 


1.1297 


1.0853 


1.1038 


1.1121 


0.9601 


1.0112 


FVML(6) 


1.1632 


1.2174 


1.1421 


1.0369 


1.1891 


1.2065 


1.1297 


1.1895 


Lin(2) 


1.1391 


1.0709 


1.1415 


1.1041 


1.1037 


1.1075 


0.9507 


0.9915 


Lin(4) 


1.0493 


0.9324 


1.0587 


1.0946 


0.9571 


0.9977 


0.7671 


0.8157 


Log(2.5) 


1.2171 


1.2195 


1.2071 


1.0917 


1.2484 


1.2188 


1.1481 


1.1926 


Log(4) 


1.0900 


0.9865 


1.0984 


1.1076 


1.0173 


1.0427 


0.8403 


1.0228 


Logis(l,l) 


1.1264 


1.1365 


1.1125 


1.0451 


1.1194 


1.1467 


1.0228 


1.0826 


Logis(2,l) 


1.4497 


1.5868 


1.4238 


1.1980 


1.5721 


1.5249 


1.7095 


1.6145 


Sq(l.l) 


1.2282 


1.3436 


1.1943 


1.0246 


1.3135 


1.2981 


1.2986 


1.3750 



Table 2: Asymptotic relative efficiencies of ii-estimators with respect to the spherical median under various 
3-dimensional rotationally symmetric densities. 

rotationally symmetric distributions: (i) the FVML(2) and FVML(4) distributions, (ii) the 
linear distribution with a = 2 and a = 4 (see (2.2)) and (hi) the square root distribution 
Sq(l.l) associated with an angular density of the form 

:= VtTa, with a = 1.1. 

The true location parameter is = (v2/2, y/2/2, 0)'. For each replication, the spherical 
median ^Median; the spherical mean Ouean, the FVML-score based i?-estimators #fvml(2) an d 
0fvml(4)> the linear-score based i?-estimators dun(2) an d 0Lin(4) an d finally the Sq(l.l)-score 
based /^-estimator ^sq(i.i) have been computed. The preliminary estimators used in the 
construction of all our one-step iZ-estimators are ^Median and 0Mcan, respectively. 

In Table 3, we report the Euclidean norm of m = (7711,7712,7713)' where, letting 9^ stand 
for the ith component of an estimator computed from the jth replication and 9i for the ith. 
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component of 0, 

M 

m * := ME(P-^ i = 1 ' 2 ' 3 ' 

3=1 

is computed still for each of the aforementioned estimators and sample sizes n = 100, n = 500 
and n = 1000. Simulation results mostly confirm the ARE rankings. Under all the dis- 
tributions considered, the optimality of the i?-estimators based on correctly specified den- 
sities is verified. In the FVML case, the spherical mean is efficient as expected, but the 
i?-estimators based on FVML scores are reasonable competitors. In general, the efficiency of 
our /^-estimators becomes better with respect to the spherical mean under departures from 
the FVML case, especially under the Sq(l.l) density. The spherical median is clearly domi- 
nated by the other estimators. The results also show that, as n increases, the influence of the 
choice of the preliminary estimator wanes; this confirms the fact that the asymptotic behavior 
of the /^-estimators does not depend on this choice (see Assumption C). 



5 A real-data application 

We now apply our i?-estimators on a real-data example. The data consists of 26 measurements 
of magnetic remanence made on samples collected from Palaeozoic red-beds in Argentina, 
and has already been used in Embleton (1970) and Fisher et al. (1987). The purpose of the 
study is to determine the origin of natural remanent magnetization in red-beds. While it is 
reasonable to assume that the underlying distribution associated with this data is unimodal 
(because a single component of magnetization is present, see Fisher et al. (1987)), a complete 
specification of the underlying distribution can not be justified, and hence semi-parametric 
methods are required. 

First, we computed the spherical mean M can = (.3187373, .4924234, - .8098924)' and the 
spherical median ^Median = (.3129147, .486036, —.8159984)'. On basis of this, we provide a 
histogram of the cosines X'^mcan, . . . , X' 26 <? m ean; see Figure 1 (the histogram obtained from 
the cosines X^$M ec iian5 • • • , X^g^Mcdian has a very similar shape). A visual inspection of the 
histogram pleads in favor of a FVML score-based i?-estimation (the black line represents a 
Gaussian kernel density estimator). Now, even if the FVML family is the target family of 
densities, the concentration parameter has to be chosen to perform our one-step /^-estimation. 
The FVML maximum likelihood estimator of k is given by kmle — 69.544 (see, e.g., Ko 
(1992)). 
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1 

amp c 
















Preliminary 


estimator 


fl Mean 


^Median 


*Mean 


fl Median 


°Mcan 


^Median 


Actual density 


Estimators 
















fl FVML(2) 


.12846 


.12813 


.00528 


.00528 


.00135 


.00134 




fl FVML(4) 


.13724 


.13779 


.00550 


.00551 


.00143 


.00142 




«Li„(2) 


.12818 


.12836 


.00531 


.00532 


.00135 


.00134 


FVML (2) 


8 Li„(4) 


.13540 


.13568 


.00576 


.00577 


.00147 


.00146 




fsq(l.l) 


.16916 


.17003 


.00647 


.00648 


.00170 


.00170 




*Mcan 


.12686 




.00528 




.00135 






^Median 




.16649 




.00687 




.00172 




d FVML(2) 


.01815 


.01810 


.00072 


.00072 


.00019 


.00018 




fl FVML(4) 


.01756 


.01757 


.00069 


.00069 


.00018 


.00017 




<>Lin(2) 


.01852 


.01849 


.00075 


.00074 


.00019 


.00019 


FVML (4) 


»Lin(4) 


.02158 


.02160 


.00087 


.00087 


.00022 


.00022 




»Sq(l.l) 


.01898 


.01909 


.00074 


.00074 


.00019 


.00018 




?Mcan 






.00069 




.00018 






^Median 




.02481 




.00098 




.00023 




fl FVML(2) 


16.40147 


15.88246 


.80516 


.80281 


.20099 


.20068 




fl FVML(4) 


17.61341 


17.41290 


.85555 


.85874 


.21485 


.21566 




<>Li„(2) 


16.27424 


16.73361 


.80201 


.80703 


.20081 


.20137 


Lin (2) 


»Lin(4) 


16.13449 


16.64150 


.85463 


.85917 


.21771 


.21827 




9 Sq(l.l) 


20.41046 


20.14262 


1.05501 


1.06998 


.25629 


.25712 




?Mcan 


16.62219 




.89528 




.22810 






^Median 




19.16783 




1.06145 




.26050 




fl FVML(2) 


201.7671 


186.5235 


16.30444 


15.85266 


3.75891 


3.73754 




fl FVML(4) 


212.7127 


212.5497 


18.71233 


18.35010 


4.35718 


4.46196 




<>Lin(2) 


199.7272 


203.3786 


15.94977 


16.21466 


3.68828 


3.71774 


Lin (4) 


*Lm(4) 


187.5686 


195.9556 


14.23143 


14.57779 


3.38679 


3.42702 




?Sq(l.l) 


231.3489 


229.2380 


22.61492 


.23.40273 


5.64923 


5.92582 




f Mean 


186.9112 




14.26441 




3.43922 






^Median 




195.7693 




17.05978 




3.94507 




fl FVML(2) 


13.28350 


13.13711 


.50958 


.50620 


.12659 


.12613 




*FVML(4) 


13.05349 


13.54705 


.44770 


.44563 


.10990 


.11025 




<>Lin(2) 


13.51480 


13.93615 


.53481 


.53583 


.13304 


.13324 


Sq (1.1) 


9 Lin(4) 


15.82969 


16.31837 


.71788 


.72184 


.18107 


.18156 




'sq(l.l) 


13.57166 


14.35157 


.42719 


.42348 


.10430 


.10487 




*Moan 


16.92412 




.77354 




.19337 






^Median 




18.55458 




.78308 




.19223 



Table 3: MSE of various .R-estimators and of the spherical mean and spherical median. For the sake of clarity, 
the results have been multiplied by 10 3 . ^-estimators are computed by taking both the spherical median and 
the spherical mean as preliminary estimators. 
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Cosine of the angles with respect to the spherical mean 



o 




I 1 1 1 1 

0.92 0.94 0.96 0.98 1.00 

Cosines 



Figure 1: Histogram of the cosines X^Mean, ■ • ■ , X^^Mean of the Palaeozoic data. The red dotted line, the 
yellow dotted line, the magenta "overplotted" line and the green line are plots of the FVML densities with 
ft = 20, k — 50, ft = 69.544 and k = 200 respectively. The black line represents a Gaussian kernel density 
estimator. 
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We computed FVML-score based i?-estimators with k = 20, k = 50, k = 200 and k = 
69.544 (we took Me dian as preliminary estimator). We obtained d K=2 o = (.3152545, .4947636, -.8099027)', 
K=5O = (.3156331, .4949186, -.8096652)', d K=20 o = (.3156907, .4948861, -.8096624)' and0 K=69 .544 = 
(0.3156517,0.4949093, -0.8096636)'. The data is illustrated in Figure 2. Data points are rep- 
resented by blue circles. The red point is K =5o- 



Palaeozoic red-beds data 




Figure 2: The Palaeozoic data (blue circles) and 6 K= so (red point). The data consists of 26 measurements of 
magnetic remanence. 

Of course, one could argue that if the FVML specification is chosen, the spherical mean 
$Mean is the efficient estimator and, as a consequence, has to be used for the estimation of 0. 
We insist here on the fact that we do not specify a FMVL distribution but we rather choose 
the FVML family as a target. As shown in the ARE results (see Table 1), FVML-score based 
estimators are robust- efficient in the sense that if the underlying distribution is not a FVML 
one, they can be more efficient than the spherical mean. 

The empirical example provided in this section raises two important open questions which 
are beyond the scope of the present paper. First, is 0~ K (the i?-estimator based on a FVML 
score with an estimated concentration parameter) as efficient as #Mean under any FVML 
distribution, irrespective of the underlying concentration? Secondly, is it more efficient than 
the same spherical mean outside of the FVML case? The answer to these questions could be 
obtained by considering a location/scale model in which we would quantify the substitution 
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of the scale parameter by a root-ra consistent estimate. This problem is currently under 
investigation. 



A Proofs 

Proof of Proposition 2.1. Since the LAN property is stated with respect to r), the density 
function (2.1) is denoted by in this proof but remains the same function. Our proof relies 
on Lemma 1 of Swensen (1985)-more precisely, on its extension in Garel and Hallin (1995). 
The sufficient conditions for LAN in those results readily follow from standard arguments 

1/2 

(hence are left to the reader), once it is shown that r} i— >• (x) is differentiable in quadratic 
mean. 

In what follows, all o(\\ ■ ||) or 0(\\ ■ ||) quantities are taken as || • || — > 0. Denoting by 



/ 

Js h 



grad,,/,/ (x) := -/„' (x)ip fl (x'h(ri))Dh(r})'x 
the gradient of the square root of the density /»j(x), quadratic mean differentiability holds if 

° {4( 2 e (x) - /„ 1/2 (x) - e'grad,/, 1/2 (x)} 2 da(x) (A.l) 

is o(||e|| 2 ) for e G Obviously, since r\ h-> h(r)) is differentiable, we have that x.'h(rf + e) — 

x'h(r)) = x.'Dh(ri)e + o(||e||) for all x G S k ~ 1 . This implies that the integral (A.l) takes the 
form 

c, ;/l I {fl^m + x' J D%)e + (||e||)) - A 1/2 (x'%)) 
Js k - 1 L 

-i/ 1 1/2 (x'/ l (r ? ))^ /l (x'%))x^/ l ( J7 )e} 2 d ( T(x). (A.2) 

Now, since f 1 / 2 inherits absolute continuity from /, j 1 ! 2 is differentiable almost everywhere 
on [—1, 1]. Consequently, for almost all x and any perturbation s£l, 

A 1/2 (x'%) + s) - fl /2 ^h( n )) = l -fl /2 (*!h( n )).p h ^h( n ))s + o(s), 

so that, using the fact that sup xgi5 fc-i |x' Dh(r))e\ < C\\e\\ for some positive constant C, we 
have that 



sup 



A 1/2 (x'%) + x' J D%)e + (||e||)) - /^(x^)) ~ \fl' 2 h Hv))*' Dh(r,)e 
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is o(||e||) uniformly in x. Consequently, the integrand in (A. 2) is o(||e|| 2 ) uniformly in x. The 
result follows since J sk -i dcr(x) = 27r fc / 2 /r(|) < oo. 

□ 

Proof of Proposition 2.2. In this proof, we only provide the expressions of and Tfl.^ 

since ULAN directly follows from the combination of Lemma 2.1 and Proposition 2.1. 

From Lemma 2.1, we obtain that the family ^P^^ I 6 € is also ULAN with central 

sequence 



Ag a := n- 1 /2 jD r( 7? )(( jD t( 7? ) ) ' jD r(7 ? ))- 1 ( J Dr(7/))'^^ /l (X^)(l - (X^) 2 )V2 S ,(X,) 



1=1 

and Fisher information matrix 



V ^h = ^^Dt(r,)((Dt( V )yDt(r,)r\Dt(r,)y. 



Next, note that if 6 = T(r}) for some r) € M k ~ 1 such that Dt is full-rank in the vicinity of 
r), then, clearly, 6 = t{rf) = Oh(rj), with h defined as in (2.4), for some rotation O G SOk- 
After some easy computations, we obtain that (putting 6 = O'd = h(rf)) 

DmnDt^yDtm^iDtin))' = ODhWiiDhWYDhWr^DhWyO' 

= o{i k -¥e')o' 
= h -ee'. 

Wrapping up, we see that, combining Proposition 2.1 and Lemma 2.1, the family |Pg n ^ | £ 
(in the ^-parameterization) is ULAN with central sequence 



Ag = n- 1 / 2 (I fc -^)E^( X ^)( 1 -W) 2 ) 1/2s ^ X ^ 



» 

— n ■ (ifc - i/i/ ) 

i=i 

n 



i=l 



and Fisher information matrix 



□ 



Proof of Proposition 3.1. Part (i) follows easily from Hajek's classical result for linear 
signed-rank statistics (see Hajek and Sidak (1967)). Parts (ii) and (iii) are consequences of 
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part (i), the multivariate central limit theorem and Le Cam's third lemma. We therefore only 
prove in detail part (iv) of the Proposition. 

For the sake of simplicity, we let 6^ n ' = 6 + n _1//2 t be the perturbed spherical location. In 
the sequel, we put U 4 (n) := X, - (XjflW)flW, U° := X, - (X<0)0, S fl(n) (X i ) := /\\v\ n) \\ 
and Sfl(Xj) := U^/||U^||. We then have the following lemma. 

Lemma A.l. For all i £ {1, ... ,n}, we have that ||S fl ( n )(Xi) — Se(Xj)|| is op(l) under Pj?^ 
wi/i /i € J 7 as n — t- oo. 

Proof of Lemma A.l. First note that 

||uS n) -u°|| = ||(X J -(X^))^))-(X i -(X^)|| 

= ||(X0)0 - (Xj0)flW + (X0)0< n > - (X<tfW)flW|| 

< ||(X^)(0-0 (n) )|| + ||X;(0-0 (ri) )0 (n) ||. (A.3) 

Both terms in (A.3) are clearly op(l) as n — >• oo. Consequently, we have that 
||S,<„)(X<) - S (Xi)\\ < 2 IIUJU-HIIU^ - U?||) is o P (l) under P^ as n -> oo. 

□ 

Proof of part (iv). From part (i) we know that A^. — A^"^ = op(l) under Pg™^ as 

n — )• oo. Similarly, A[?\ - A a = op(1) under Pi?2-, as n — > oo. Hence, from 
~6j">;K B {n ';K, gi w { > ;gi 

contiguity, A ,L — Av, is also op(l) under Pi"' as n — )• oo. This entails that the 
claim holds if 

is op(l) under Pg^ as n — > oo. Consequently, the result follows if we can show that 
( a ) A< ei)- K , gi ~ A< £kgi ~ E ^ A %;K, g ) is ° L2 ^ under P £ as n ~* °°' and that 



(b) E [A^ ); ^ g J +r 0; ^ 5l tW is o(l) under as n ^ oo. 

We first prove (a). Using the fact that X^0 and Sg(Xj) are independent (as mentioned in the 

n) \ 



Introduction), we have (the expectation is taken under P^ 



E 



A (n) 



n 

5^E[iif(Gi(Xj«))]E[S ff (X i )]=0, 



i=i 



since, under Pg!^, the sign Sfl(Xj) is uniformly distributed on S k l {6^). Now, let D^ n ^ : 
n-V2 E n =i ( T W_ E[T WA where T (n) := ir( G 1 (X^W))S, ( „ ) (X i ) - iT(G 1 (X^))S e (X i 
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Clearly, D^ n ^ corresponds exactly to the expression in (a), since E[Ag n ^- 9 J = as explained 
above. Using i.i.d.-ness and standard algebra arguments, we have that, under Pg!^, 



E[||D( n )|| 2 ] 



n _1 E 



n 1 tr 



^ T (n)_ E[T 



i=l 



Var 



Wi 



i=l 



= E[||TS n) || 2 ]. 

Therefore, it remains to show that E[||T^|| 2 ] is o(l) as n — > oo. We have that 



-.("■) ||2 



KiC^e^)) - K(G\(yL' x Q)) \ S eW (X! 



+ir(G 1 (x' 1 0))(s fl(n) (x 1 )-s e (x 1 )) 



< 2 



\ S 0(") (Xi)|| 2 



+2 ^ 2 (G 1 (X^))||S 0( , 1) (X 1 ) - S fl (Xi 



The continuity of K o G\ together with the fact that X^fl — X^fl^ is op(l) (under ig gi as 
n -> oo) imply that K(Gi(X' 1 ^ (n) )) - J^(Gi(X^)) is o P (l) under as n -> oo. Since 

if is continuous on a compact support (and is therefore bounded, see Assumption D), this 
convergence also holds in quadratic mean. Similarly, the boundedness of S#(Xi) and S fl ( n ) (Xi) 
together with Lemma A.l yields the result of (a). 

We now turn to the proof of (b). Parts (i), (ii) and (iii) readily state that, under Pg™^ and 
for n — > oo, 

^K, 91 ^^k{O,T . K ), (A.4) 

In) 

and that under the sequence of local alternatives Pg(„) , as n — )• oo, 

Now, it follows from ULAN that the model is locally and asymptotically linear (see, e.g., 
Bickel (1982)). Therefore, Assumption (D), the continuity of Tg. Kgi and contiguity entail 



that, under P 



(n) 

0;<7i' 



An) 



ill 



+% 91 tW4Ar fc (o,r fi;Jf ) 



(A.5) 
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as n -> oo. We have shown in (a) that ~ A ; ir, gi ~ E IAj(») # ] is under P S 

as n — >■ oo. Therefore, in view of (A. 4), we have that 

Comparing (A. 5) and (A. 6), it follows that E[A^2) K ] ~\~'^0;K,g 1 ^ <yr ^ is o(l) as n — > oo under 
Pi"^ which is the desired result. 

Since we have proved both (a) and (b), the claim of part (iv) of the proposition holds, 
which concludes the proof. □ 
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